Capacity  of  the 

Independent  Increment  Noise  Channel 


AD-A235  i 


Michael  R.  Frey 


Department  of  Statistics 
University  of  North  Carolina  at  Chapel  Hill 


LISS44 


30  October  1990 


OTIC 


ELECTE 
MAY1  3  1991 


E 


Abstract  -  A  communications  channel  model  with  additive,  independent  increment,  stochast¬ 
ically  continuous  noise  is  presented.  The  information  capacity  of  this  channel  is  obtained 
for  a  mean-square  constraint  imposed  on  the  "generalized"  intensity  of  the  channel  encoder. 
The  capacity  is  derived  by  exploiting  the  Levy  decomposition  of  the  noise  to  decompose 
the  channel  into  a  white  Gaussian  noise  channel  in  parallel  with  a  marked  Poisson  channel. 
The  information  capacity  is  then  obtained  using  results  for  these  latter  channels.  In  particu¬ 
lar,  recent  results  for  the  Poisson  channel  with  mean-square-constrained  encoder  intensity 
are  exploited.  Although  only  the  channel  without  feedback  is  explicitly  addressed,  our 
results  are  also  seen  to  be  applicable  in  the  presence  of  delayed  causal  feedback. 
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I.  INTRODUCTION 
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of  the  independent  increment  noise  channel  is  given  in  Figure  1.  In  this  model,  a  random  message  de-  .  t;Si.  ym  J 
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Figure  1.  Independent  increment  noise  channel  model. 


scribed  by  a  stochastic  process  0,,  le  [0,7]  is  encoded,  perhaps  using  channel  feedback,  and  the 
encoded  message  X,  =X,(Q,Y)  is  transmitted.  The  encoded  message  is  corrupted  in  transmission  by  the 
addition  of  an  independent  increment  noise  process  N,,  t  e  [0,7]  so  that  the  signal  observed  by  the 
receiver  is  Y,  =X,  +N,.  The  channel  model  is  presented  in  further  detail  in  the  next  section. 

Within  the  taxonomy  of  continuous-time  channel  models,  the  independent  increment  noise  chan¬ 
nel  occupies  a  unique  position.  First,  the  independent  increment  noise  channel  is  a  natural  point  of 
departure  for  the  study  of  more  general  continuous-time  channels  such  as  those  with  conditionally 
independent  increment  noise,  Markov  noise,  or  martingale  noise.  Independent  increment  noise  is  a  spe¬ 
cial  case  of  all  three  of  these  classes  of  noise  processes.  Second,  the  independent  increment  noise  chan¬ 
nel  is  the  continuous-time  analogue  of  the  discrete  memoryless  channel  [9],  Thus  one  finds  properties  of 
the  discrete  memoryless  channel  reflected  in  the  independent  increment  noise  channel.  For  instance,  the 
capacity  of  the  discrete  memorylcss  channel  is  not  increased  by  the  availability  of  fccdtrack  [2],  [17], 
The  same  result  holds  under  certain  conditions  for  the  independent  increment  noise  channel  [11]. 
Finally,  both  the  white  Gaussian  noise  channel  and  the  Poisson  channel  with  nonrandom  noise  intensity 
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belong  to  the  class  of  independent  increment  noise  channels.  Each  of  these  two  channel  models  is 
relevant  to  modem  communication  systems:  for  example,  the  Gaussian  model  for  satellite  systems  and 
the  Poisson  model  for  laser/optical  communications. 

In  this  initial  treatment  of  the  independent  increment  noise  channel,  we  derive  its  information 
capacity.  We  consider  only  independent  increment  noise  channels  with  L6vy  process  noise;  that  is,  sto¬ 
chastically  continuous,  independent  increment  noise.  The  L6vy  process  can  be  decomposed  into  a 
continuous-path  Gaussian  part  and  a  purely  discontinuous-path  marked  Poisson  part  [6].  The  existence 
of  this  decomposition  for  L6vy  noise  processes  allows  us  to  split  the  independent  inurnment  noise  chan¬ 
nel  into  Gaussian  and  marked  Poisson  parts.  Then  the  information  capacity  can  be  calculated  using 
results  for  the  capacity  of  the  Gaussian  and  marked  Poisson  channels.  Heretofore,  the  information  capa¬ 
city  of  the  marked  Poisson  channel  in  any  guise  was  unknown  (although  see  Davis’  treatment  [4]  of 
orthogonally  polarized  optical  channels.)  Therefore,  to  obtain  the  information  capacity  of  the  indepen¬ 
dent  increment  noise  channel,  some  attention  must  be  given  to  the  marked  Poisson  channel. 

A  difficult  part  of  formulating  the  capacity  problem  for  the  independent  increment  noise  channel 
is  the  imposition  of  a  suitable  constraint  on  the  channel  encoding  process.  A  suitable  constraint  should 
appeal  to  physical  intuition,  it  should  be  capable  of  separation  into  two  parts  corresponding  to  the  Ldvy 
decomposition  of  the  channel  noise,  and  its  two  parts  should  admit  finite  solutions  for  the  capacity  of 
the  corresponding  Gaussian  and  marked  Poisson  subchannels.  A  mean-square  constraint  on  a  "general¬ 
ized"  channel  intensity  meets  all  these  demands.  While  the  capacity  of  the  white  Gaussian  noise  chan¬ 
nel  with  mean-square  constraint  is  well-known,  such  is  not  the  case  for  the  Poisson  and  marked  Poisson 
channels.  Previous  work  on  the  Poisson  channel  is  limited  to  peak  and  average  constraints  [4],  (?].  Thus 
we  present  here  certain  results  for  the  capacities  of  the  mean-square-constrained  Poisson  and  marked 
Poisson  channels.  A  fuller  treatment  of  the  mcan-squarc-constraincd  Poisson  and  marked  Poisson  chan¬ 
nels  is  planned  for  a  separate  paper. 

The  remainder  of  this  paper  is  organized  into  three  sections.  The  first  section  describes  the  chan¬ 
nel  model  in  detail.  The  second  section  states  the  capacity  of  the  channel  and  outlines  the  steps  in  its 
derivation.  Proofs  of  results  used  in  this  derivation  arc  presented  in  the  last  section. 


II.  channel  model 


The  noise  in  the  independent  increment  noise  channel  is  modeled  by  a  Levy  process.  The  proper¬ 
ties  of  this  process  are  given  in  [6],  [7],  For  convenience  certain  of  these  properties  are  restated  here. 

The  Lfevy  Process 

A  stochastic  process.  0<r  <T,  is  an  independent  increment  process  if,  for  all  0<fo<G<  ... 
<tn<T,  the  quantities  (  are  mutually  independent  We  say  a  process  ^  is  sto¬ 

chastically  continuous  at  time  i0  if 

->0 

as  |r  —  r0|  — >  0.  If  E,  is  stochastically  continuous  over  the  whole  interval  [O.T]  (with  the  definition  of  sto¬ 
chastic  continuity  at  the  endpoints  of  the  interval  suitably  modified),  then  £  is  said  to  be  stochastically 
continuous.  Stochastically  continuous  processes  with  independent  increments  have  no  discontinuities  of 
the  second  kind  w.p.  1  [6,  p.  168].  Every  such  process  has  a  unique  version  which  is  ciidlag  (right- 
continuous  with  extant  left  limits).  This  version  of  a  stochastically  continuous,  independent  increment 
process  is  called  a  L6vy  process  [7].  Every  Levy  process  can  be  represented  in  terms  of  independent 
Gaussian  and  Poisson  processes  as  follows  [6,  Theorem  1,  p.271]: 

Let  ^(,  0<»  <T,  be  a  Levy  process.  Let  J  be  the  space  of  jumps  which  occur  at  the  discontinui¬ 
ties  in  the  paths  of  Zero  is  expressly  excluded  from  belonging  to  J\  jumps  of  size  zero  are  disal¬ 
lowed.  For  A  eZj,  define  y,04)  to  be  the  number  of  points  se  [0,/]  for  which  -^,_0g  A  -{0). 
Also  define  y,  (A )  =  y,  (A )  -  n,  (/l )  where  11,  (A  )  =  E  [y ,  (A )].  Then 

i=G,  +  f  xy,(a[x)  +  J  xy ,(dx)  (1) 

u  r>  i  u  fs  i 

where  G,  is  an  independent  increment  Gaussian  process,  y,  (A )  is  an  independent  increment  Poisson 
process  for  each  A  e  y,(A)  and  y,(6)  are  independent  for  AnB  =0,  and  G,  is  independent  ol 
y,(A)  for  every  A  eZj. 
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It  is  primarily  for  convenience  that  the  L6vy  process  is  emphasized  in  this  presentation  over  the 
more  general  independent  increment  process  which  is  not  necessarily  stochastically  continuous.  The 
general  independent  increment  process  has  a  unique  decomposition  [7]  in  terms  of  a  deterministic  part, 
a  stochastically  continuous  (L6vy)  part,  and  a  stochastically  discontinuous  part.  Nonrandom  translations 
of  the  noise  have  no  effect  on  channel  information  or  capacity  so  we  can  assume  the  deterministic  part 
is  absent.  The  discontinuous  part  represents  the  process  at  the  points  at  which  it  is  not  stochastically 
continuous.  These  points  are  no  more  than  countable  in  number  [7]  and,  since  the  location  of  these 
points  are  part  of  the  specification  of  the  process,  can  be  avoided  by  channel  users.  Thus,  we  focus  on 
the  stochastically  continuous  part  of  the  independent  increment  noise  process  and  define  the  independent 
increment  noise  channel  model  with  L6vy  noise. 

The  Independent  Increment  Noise  Channel  Model 

The  independent  increment  noise  channel  is  an  additive  noise  model  Y,=X,+N,,  0 <t  <T  incor¬ 
porating  causal  feedback  (Figure  1).  The  process  Y  is  the  channel  output,  X  is  the  encoded  message, 
and  N  is  the  channel  noise.  W 8  and  Fr  denote  the  natural  histories  of  the  processes  0  and  Y ,  i.e„ 
F,9=ct[0j,O<j  <t]  and  F?=o[Y,,0£s  Sf]  for  t  e  [0.T],  We  fix  T>0  and  define 
F?x  =  a[Yt_z,0<s  Sr],  taking  Y,  =0  for  all  t  < 0.  FYx  is  the  history  of  Y  delayed  by  t.  The  encoded 
message  process  X  in  the  channel  model  is  specified  to  be  an  Fe^Fr  T-adapted  functional  of  Y  and  the 
message  process  0.  Thus  the  channel  model  includes  noiseless  delayed  causal  feedback  from  the  chan¬ 
nel  output  Y  and  nonanticipative  encoding  of  the  message  0.  We  do  not  consider  the  case  T=0  of 
instantaneous  feedback.  This  case  seems  impractical  and  raises  presently  unresolved  technical  questions. 
In  the  no-feedback  version  of  the  channel  model,  the  encoded  message  is  an  F8-adapted  functional 
X,  =  X,  (0)  of  only  the  message  0.  The  case  t  >  T  is  equivalent  to  the  no-feedback  case.  Also  as  part  of 
the  channel  model,  X  is  required  to  be  chdlhg  with  X0  =  O.  The  channel  noise  N  is  a  L6vy  process  and 
is  independent  of  0.  Following  (1),  we  write 

N,  =  W,  +  P,  (2) 

with  W0  =  P 0  =  0.  In  (2),  W,  is  a  zero-mean  Wiener  process  [6],  [12]  with  finite  continuous  variance 
measure  p.  By  this  it  is  meant  that  W,  has  continuous  paths  and  Gaussian  independent  increments 
Wt-  W(i  such  that,  for  all  0 <f1<r2<7\ 
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EIW,-  W.J  =  0.  El(W,r  W.f]  =  Pu/LfJ) 

where  p  is  a  finite  continuous  (w.r.t.  Lebesgue  measure)  measure  defined  on  ([0,7],LOT)).  Also  in  (2), 
P,  is  a  purely  discontinuous  [15]  (piecewise-constant)  cbdlag  process 

P,  =  jm  P,(dm)  (3) 

j 

where,  for  each  Fe  I;,  the  quantity  P,(F )  denotes  the  number  of  points  je  [0,/]  such  that 
Ps  -P,- oG  F  ■  Pi(‘)  is  a  marked  Poisson  point  process  with  mark  space  J  a  IR .  To  distinguish  the  nota¬ 
tions  P,  and  /’,(•).  we  remark  that  P,  is  a  jump  process  while,  for  instance,  P,(J)  is  a  counting  process 
which  gives  the  number  of  jumps  (irrespective  of  size)  of  P,  in  the  interval  [0,r].  It  follows  from  the 
Levy  nature  of  N  that  W  and  P  are  independent  processes.  Also,  for  71n72=0 ,  the  counting 
processes  P(F{)  and  P(F£  are  independent.  Because  P,(F)  is  Poisson  [6]  for  a  given  F  e  Zj,  it  has  a 
deterministic  compensating  measure  B  which  here  we  assume  lakes  the  form 

S([0,r]xf)  =  X6([0,t])p(F)  (4) 

for  some  A.>0  and  all  FeZj  and  t  e  [0,7].  The  set  functions  b  and  p  in  (4)  are  required  to  be  finite 
measures  on  ([0,7], £[o,r])  and  (7 Xj)  respectively.  The  measure  b  is  continuous  w.r.t.  Lebesgue  meas¬ 
ure.  (Both  b  and  P  are  continuous  as  a  consequence  of  the  stochastic  continuity  of  N .)  Finally,  both  p 
and  b  are  assumed  to  be  standardized: 

p(/)=l,  6  ([0,7])  =  7  . 

Looking  back  to  (1),  it  is  evident  that  deterministic  terms  have  been  arbitrarily  added  and  sub¬ 
tracted  to  produce  the  decomposition  of  N  in  (2).  We  maintain  that  this  does  no  harm  because  the  noise 
decomposition  in  (2)  is  used  only  for  calculating  channel  information  and  capacity  -  and  those  quanti¬ 
ties  are  unaffected  by  nonrandom  translations  of  the  noise. 

The  paths  of  the  L6vy  noise  process  N  in  the  channel  model  are  cbdlag.  Thus,  without  the  com¬ 
parable  restriction  (cbdlag  paths)  on  the  encoded  message  process  X,  the  encoder  could  communicate 
without  error  at  any  desired  rate  by  transmitting  non-cbdlbg  paths.  Hence,  for  finite  information  capa¬ 
city,  X  must  be  cbdlag.  Given  that  X  is  cadlag,  its  paths  can  only  have  discontinuities  of  the  first  kind 
[1]  and  these  are  countable  in  number.  Thus  it  is  meaningful  to  define 

=  £(*,-*,- o).  *£,  =X,  -XdJ. 

s*t 
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Both  X  and  N  are  chdlhg  so  Y  =X  +N  must  be  chdlag  as  well  and  we  define 


y</,  =  i^-u, 


Yc,  =  Y,-Y, 


d,i 


Analogous  to  (3),  we  can  write  Xd  as 


X<tj  =  \mXdj(dm) 
j 

where  the  marked  Poisson-type  point  process  Xdl(F)  counts  the  number  of  jumps  of  Xd  with  size  in 
F  e  Zj  in  the  interval  [0,/].  Likewise,  we  have 

Yd.,=jm  YdJ (dm). 


Conversely,  the  marked  point  processes  Xd()  and  Yd(-)  can  be  expressed  in  terms  of  their  jump  process 
counterparts  as  follows: 


XdJ(F)=  IlF(^,-^,-o) 

=  FeZy' 


Using  the  notation 
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for  the  message  process  0  in  the  independent  increment  noise  channel  model,  we  have 

Yc,=XcAQcJ)  +  Wt>  (5) 

YdJ  =  xdJ{Bd,Y)  +  rt.  (6) 


The  jump  process  channel  expressed  by  (6)  can  be  just  as  well  viewed  as  a  marked  Poisson  channel; 
that  is,  as  a  Poisson  channel  in  the  sense  of  [4],  [8]  in  which  marks  (the  jump  size)  are  associated  with 
each  jump  event  in  the  channel.  In  terms  of  the  marked  point  processes  Xd  ,(■),  P,(  ),  and  Yd  ,(■),  we 
write 

YdAF)  =  Xdd(Bd,YXF)  +  P,(F),  F  e  Zj  .  (7) 

The  two  formulations  (6)  and  (7)  arc  equivalent;  however,  (7)  has  the  advantage  of  being  applicable  in 
other  contexts  so  it  is  the  formulation  we  use. 


7 


The  continuous-path  part  Xc  of  the  encoded  message  is  assumed  to  take  the  form 

l 

Xc.i  =  l$sb(ds) 


(8) 


where  4>i  =  4>, (8C ,  K )  is  an  F  c  ^F>/-'c-prcdictabie  functional.  Also,  the  measurability,  integrabil ity,  and 
identifiability  conditions  laid  out  in  [10]  for  (5)  and  (8)  are  assumed  to  be  satisfied.  If  the  variance 
measure  P  of  W  in  (2)  is  absolutely  continuous  w.r.t.  b  and  if  a  mean-square  constraint 

rT 


JE 


J  *Mdt) 


<p 2 


(9) 


is  imposed  on  the  encoding  kernel  <j),  then  (5)  and  (8)  describe  a  form  of  the  additive  white  Gaussian 
noise  channel  treated  by  Kadota,  Zakai,  and  Ziv  [10].  Therefore,  to  use  their  results  for  information 
capacity,  we  assume  (9)  to  be  in  force  and  take  P «b.  In  fact,  it  is  assumed  that  P  and  b  are  sealed 
versions  of  one  another  P=ti6.  The  scalar  T|>0  is  termed  the  Wiener  noise  intensity  of  N,.  r\  is  the 
counterpart  to  the  Poisson  noise  intensity  X  introduced  in  (4). 

The  compensator  A  of  the  discontinuous  part  Xd  in  (6)  is  assumed  to  have  the  form 

t 

AaO,t]xF)  =  [jxs(mMdm)b(ds)  (10) 


where  the  (nonnegativc)  jump  intensity  x,(m)  is  a  IF**  ''F^-prcdicinble  functional,  p  is  the  finite 
measure  on  (7,Zy)  introduced  in  (4),  and  b  is  the  measure  already  appearing  in  (4)  and  (8).  The  meas¬ 
ures  b  in  (4),  (8)  and  1. 10)  need  not  have  been  chosen  to  be  identical;  however  >his  is  a  natural  choice. 
b  is  called  the  base  measure  of  the  independent  increment  noise  channel.  In  usual  applications,  b  is 
Ixbesgue  and  the  reader  uninterested  in  more  abstract  situations  is  invited  to  view  b  as  Lebesgue  for 
the  remainder  of  this  paper. 


Suppose  a  mean-square  constraint  similar  to  (9)  is  imposed  on  the  jump  intensity  x<(m): 

fr  l 


.oj 


$e2. 


(11) 


With  %  >n  (10)  satisfying  (11),  the  subchannel  model  in  (7)  is  recognized  to  be  a  straightforward  gen¬ 
eralization  of  the  Kabanov  Poisson-type  point  process  channel  model  [8]  with  a  mean-square  constraint 
in  place  of  the  peak  constraint  considered  there 


8 


The  mean-square  constraints  (9)  and  (11)  suggest  a  way  to  impose  a  single  mean-square  con¬ 
straint  on  the  encoder  output  X  by  a  formal  unification  of  0  and  X-  F°r  a'l  1  ^0,  define 


J  y.t  (m ) . 

lo¬ 


rn  e  J 
m  =0 


to  be  the  formal  intensity  of  the  c'adlhg  process  X  =XC  +  Xd  with  the  parts  Xc  and  Xd  as  in  (8)  and  (10). 
We  introduce  the  formal  mark  measure  v  =  p  +  S0  defined  in  the  obvious  way  on  the  whole  of  the  space 
M  -  J  u(0).  Here,  80  denotes  the  atomic  measure  which  assigns  all  of  its  unit  mass  to  the  point  0. 
Since  (i(/)=l,  v  gives  equal  weight  to  the  continuous  and  discontinuous  parts  of  the  channel.  Now  the 
two  constraints  (9)  and  (11)  are  combined  in  the  single  mean-square  constraint 

v 

E  VHm  )b  (di)  <  n2  (1 2) 

.  ou 


on  the  formal  intensity  V  of  the  encoded  message  process  X.  In  this  context,  we  call  the  LHS  of  (12) 
the  formal  power  of  the  encoder  output  and  I"l2  the  formal  power  available  to  the  encoder. 

To  summarize,  the  independent  increment  noise  channel  model  is  given  by  (5)  and  (7)  with  Levy 
noise  N  in  (2).  The  noise  variance  (?  measure  is  related  to  the  channel  base  measure  by  p  =  qb  for  some 
T|>0.  The  noise  compensating  measure  B  is  correspondingly  assumed  to  take  the  form  for  some 
X>0.  The  encoded  message  X  is  assumed  to  be  cadlag  with  /F9'' IF  ^-predictable  (X > 0)  formal  inten¬ 
sity  4*  sadofying  the  mean-square  constraint  (12).  The  channel  base  measure  h  is  assumed  to  be  con¬ 
tinuous  with  h([0,T])  =  7  while  the  formal  mark  measure  v  has  an  atom  of  unit  mass  at  zero  and  unit 
mass  (without  atoms)  elsewhere: 

v([0))=I.  v(7 )  =  1  . 


III.  CHANNE'.  CAPACITY 


In  this  section  the  information  capacity  of  the  independent  increment  noise  channel  is  given  and 
its  derivation  is  outlined.  Proofs  of  results  used  in  the  derivation  are  given  in  the  next  section.  We  begin 
by  recalling  the  definitions  of  channel  information  and  channel  capacity. 


Channel  information  is  defined  to  be  the  average  mutual  information  [14]  in  the  message  9  and 
the  channel  output  Y  over  the  interval  [0,7]: 


IT[Q,Y]  =  E 


d  Jior 

In- - 

d[lo,r  J 


provided  otherwise  /r[0,Y]=°°.  Here,  p0r,  Po,  and  pr  arc  the  joint  and  marginal  measures 

induced  by  the  message  and  output  processes,  9  and  Y,  on  the  spaces  So,  Sr,  and  So*. Sr  where  So  and 
Sr  are  the  spaces  of  trajectories  of  0  and  Y  over  the  interval  [0,7].  Also,  po*r  is  the  product  measure 
induced  by  p«  and  pr.  The  information  capacity  of  the  independent  increment  noise  channel  is  the 
supremum  ot  the  channel  information 

€  =  SU0P  sup  I iT[Q,Y\ 


where  0  is  any  jointly  measurable  process  defined  over  the  interval  [0,7]  and  X  =  X(0,Y)  is  any  cadlag 
F0''/Fr  T-adapted  message  encoding  satisfying  the  mean-square  constraint  (12)  on  its  formal  intensity. 

The  independent  increment  noise  channel  with  delayed  feedback  considered  here  is  mcmoryless  in 
the  sense  of  Kadota  in  [9]  and  Kadota,  Zakai,  and  Ziv  in  [1!|.  In  [11]  ii  is  shown  that  the  information 
capacity  of  a  continuous-time  memoryless  channel  can  not  be  increased  by  feedback.  Thus,  for  the 
present  purpose  of  obtaining  the  information  capacity  of  the  independent  increment  noise  channel  with 
x>0,  we  can  and  do  restrict  our  attention  to  the  no-fccdback  case.  This  significantly  simplifies  the 
problem.  In  particular,  we  exploit  the  facts  that  in  the  absence  of  feedback  the  processes  X  and  N  arc 
independent  and  that  Yc  and  Yd  arc  independent  if  and  only  if  the  components  0j  and  0C  of  the  mes¬ 
sage  are  independent. 

The  parallel  Gaussian  and  marked  Poisson  subchannels  of  the  independent  increment  noise  chan¬ 
nel  dominate  its  structure.  In  the  absence  of  feedback,  these  two  subchannels  have  two  points  of 
interaction:  the  transmitted  message  components  Qc  and  Gd  may  be  dependent  and  the  encoders  of  the 
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two  subchannels  compete  for  the  available  formal  power  Id2.  Thus,  the  independent  increment  noise 
channel  is  distinguished  by  the  possibility  of  redundantly  encoding  some  or  all  of  the  message  to  be 
transmitted  into  both  Xd  and  Xc.  It  is  not  immediately  clear  what  effect  such  redundancy  has  on  the 
combined  capacity  of  the  two  subchannels.  This  question  is  settled  with  the  aid  of  some  definitions. 

Let  the  combined  message  and  output  of  n  additive  noise  channels  be  expressed  by 

0  =  (0, ,  02, ....  0„ )  and  Y  =(Tj,  )-'2,...,  T„),  respectively.  Also,  define  N  =(A';,A'; . ,V„)  and 

X  =(X  i,X2,~,X„).  The  parallel  combination  channel  (Q.X  ,N  ,Y)  with  n  subchannels  is  defined  to  have 
the  following  properties: 

1.  Yij  =Xt  l  +  NtJ  ,  /  e[0,7  ],  i  =  l,2,...,n  , 

2.  XtJ  =  X,  ,(d,)  is  /f8, -adapted  ,  i  =  1,2 . n  , 

3.  0 ,  N  are  independent, 

4.  N, ,  iV2,  ....  A/,  are  mutually  independent. 


Disregarding  the  mean-square  constraint  on  the  formal  intensity,  we  observe  that  the  independent  incre¬ 
ment  noise  channel  without  feedback  is  a  parallel  combination  channel  with  two  subchannels. 

The  capacity  of  the  parallel  combination  channel  is 


C  = 


sup 

86/;, 


sup 
.V,  6  A- 
XeC 


Vr[e,n 


where  B]  is  the  class  of  n -tuples  (0i,...,9„)  of  jointly  measurable  processes  and  A,,  i  =  1,2 . n  is  the 
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class  of  IF  ‘-adapted  encodings.  Also,  C  is  a  class  of  encodings  satisfying  a  fixed  collection  of  con¬ 
straints;  for  the  independent  increment  noise  channel.  C  is  the  class  of  cadlag  processes  with  formal 
intensity  satisfying  (12).  It  is  clear  that  the  capacity  of  the  parallel  combination  channel  is  at  least  as 
great  as  the  sum  of  the  capacities  of  the  individual  subchannels.  The  following  theorem  states  that  noth¬ 
ing  is  gained  by  redundantly  encoding  some  part  of  the  message;  allowing  components  of  0  to  be 
dependent  does  not  increase  capacity.  The  theorem’s  proof  is  given  in  the  next  section. 


Theorem  1:  Let  (8  ,X  ,N  ,Y)  be  a  parallel  combination  channel  with  2  subchannels  (0,,,Y1,A/1,)'1) 
and  (92,Y2,  N2,  Y^).  The  capacities  of  the  two  subchannels  arc 

1 


7/T i0.^. 
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^Cxl)  =  ^Xj6gPnA27/7'(02-yJ 

e 

where  B  is  the  class  of  jointly  measurable  processes,  A,,  t=l,2  is  the  class  of  F  ‘-adapted  encodings, 
CX{  is  the  set  of  all  encodings  X  e  C  with  first  component  X ,  fixed,  and  CXl  is  the  set  of  all  encodings 
X  6  C  with  second  component  X2  fixed.  Then 


Similar  statements  can  be  made  for  parallel  combination  channels  with  more  than  2  subchannels. 
In  terms  of  our  channel  model.  Theorem  1  states  that  the  information  capacity  of  the  independent  incre¬ 
ment  noise  channel  is 

c“^n*K(/?)  +  c"(e)}  (13) 

where  Cw(R)  is  the  capacity  of  the  Gaussian  channel  (5),  (8)  with  Wiener  noise  W  and  mean-square 
constraint  parameter  R  in  (9)  and  where  €/>(Q)  is  the  capacity  of  the  marked  Poisson  channel  (7),  (10) 
with  marked  Poisson  noise  P  and  mean-square  constraint  parameter  Q  in  (11).  The  capacity  CW(R)  is 
known  [10];  with  or  without  feedback  it  is 

Cw(/?)=C'  °4) 

Therefore,  to  obtain  the  capacity  of  the  independent  increment  noise  channel,  it  only  remains  to  deter¬ 
mine  the  capacity  of  the  marked  Poisson  channel  with  mean-square  constraint  (11)  and  to  then  find  the 
supremum  in  (13). 

The  following  theorem  gives  the  capacity  of  the  marked  Poisson  channel.  The  proof  is  given  in 
the  next  section. 

Theorem  2:  Consider  the  marked  Poisson  channel  model  (7)  with  signal  compensator  (10)  and 
noise  compensator  (4).  Suppose  that  the  channel  base  and  mark  measures  b  and  p  arc  both  finite  and 
continuous  and  let  the  encoder  intensity  x,(m)  be  IF1  ~IF  a -predictable  and  mean-square-constrained  as 
in  (11)  for  some  Q  SO.  Then  Cp(Q)  =  D(\,Q)  where  D(0,Q)  =  2Q/e  and,  for  \>0, 
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D(k,Q) 


Xa2 


*■ 

P 2 

p 2 

1  +  a 

»  •> 

|ln(l  +a)  -  X 

aX2 

In 

aX2 

<  J 

where  a  is  the  unique  solution  of  the  equation 

r 

P 2 


In 


aX2 


■+ 1 


+  2  = 


1  +  1 
a 


!n(  1  +  a). 


(15) 


The  information  capacity  given  in  Theorem  2  is  the  instantaneous  feedback  capacity.  When  the 
proof  of  this  result  is  examined,  however,  it  is  seen  that  feedback  is  not  used  in  its  derivation.  Thus 
(15)  is  also  the  capacity  for  delayed  feedback  (t  >0)  and,  in  particular,  for  the  case  in  which  the 

g 

encoder  intensity  (m )  is  restricted  to  be  F  d -predictable  -  the  no-feedback  case.  Causal  feedback  with 
or  without  delay  cannot  increase  capacity  in  the  marked  Poisson  channel  considered  here. 

With  regard  to  Theorem  2,  we  note  that  although  the  theorem  is  presented  for  a  mark  space 
J  C.1R  of  jumps,  one  obtains  the  same  result  for  more  general  mark  spaces  -  provided  the  associated 
mark  measure  is  finite. 

The  following  theorem  follows  from  (13)  using  Theorem  2  and  (14). 

Theorem  3:  Fix  t>0.  The  information  capacity  of  the  independent  increment  noise  channel  is 
C  =  r(M,n)  where 

ra,ri.n)  =  0ss^n|^2!  +  D(X.Q)j.  (16) 


The  supremum  in  (16)  does  not  in  general  admit  analytic  evaluation  for  X>0.  However,  for  X=0, 
f  can  be  more  simply  expressed.  Using  D(0,Q)  =  2Qle ,  the  supremum  in  (16)  is  found  to  be 


rxo.Ti.ri)  =  i  -J 


rr  2q 

7v  +  -2 


-n, 

e 


n>*t 

n<^ 

e 


(17) 


r(0,Ti,n)  is  presented  as  a  function  of  n  in  Figure  2.  For  comparison,  the  corresponding  Gaussian  and 
Poisson  subchannel  capacities  nz/(2r|)  and  2fl/e  are  also  shown. 
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Figure  2.  Information  capacity  <C  of  the  independent 
increment  noise  channel  with  X  =  0. 
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The  expression  for  r(0,T|,n)  in  (17)  exhibits  a  transition  point  at  2 q/e .  This  point  is  obtained  by 
solving  for  the  zero  of  the  derivative  of 

+ 1 
2r|  e 

with  respect  to  Q.  For  n<2q/e,  to  maximize  channel  capacity,  one  does  best  to  allocate  all  available 
formal  power  to  the  marked  Poisson  subchannel.  For  n  >2q/e ,  the  formal  power  should  be  apportioned 
between  the  two  subchannels  with  formal  power  equal  to  (2q/e)2  assigned  to  the  marked  Poisson  sub¬ 
channel  and  the  remainder  to  the  Gaussian  subchannel.  For  maximum  information  rate  in  the  channel, 
at  least  some  formal  power  must  be  assigned  to  the  Poisson  subchannel  in  all  cases  where  X=0. 

The  optimal  distribution  of  the  formal  power  172  among  the  two  subchannels  of  the  independent 
increment  noise  channel  for  the  case  X>0  can  be  addressed  numerically,  affording  further  insight  to  the 
nature  of  the  channel.  Numerical  calculations  involving  r  are  somewhat  simplified  by  the  fact  that  T  is 
first-order  homogeneous;  i.e.  r(wx,wy,wz)  =  ivr(x,v,z)  for  all  w  >0.  The  results  of  calculations  made 
for  the  optimal  distribution  of  formal  power  arc  displayed  in  Figure  3  for  a  range  of  X/n  and  q/n. 
There  are  three  possible  forms  the  optimal  distribution  might  take:  assignment  of  all  of  n2  to  the  Pois¬ 
son  subchannel,  assignment  of  all  of  n2  to  the  Gaussian  subchannel,  or  a  nontrivial  apportionment  of 
n2  to  both  subchannels.  According  to  Figure  3,  all  three  possibilities  do  occur  for  the  independent 
increment  noise  channel. 

Figure  3  shows,  in  particular  that  for  X>q,  the  optimal  distribution  of  formal  power  is  to  assign 
all  of  n2  to  the  Gaussian  subchannel.  Thus,  in  this  case,  the  capacity  of  the  independent  increment 
noise  channel  is  simply 

e=-£-,  x  >  q .  (18) 

A  proof  of  (18)  is  given  in  outline  as  follows:  Hold  X  and  q  fixed  with  X>q>0.  Let  g(Q)  and  d(Q) 
denote  the  first  partial  derivatives  with  respect  to  Q  of  Q2/(2q)  and  D(X,(X),  respectively.  The  second 
derivative  of  D(\,Q)  is  found,  upon  evaluation  at  Q  =  0,  to  be  equal  to  1.  Because  D(k,Q)  is  first- 
order  homogeneous,  it  then  follows  that  d'(0)=l/X.  Also,  d\Q)  is  a  decreasing  function  of  Q. 
g'(Q)=  1/q  so,  for  X>q  and  all  Q  >0,  g'(Q)Zd'(Q).  We  have  g(0)  =  d(0)  =  0  so  g{Q)>d(Q)  for  all 
Q  >0.  Then  the  supand  in  (16)  is  nonincrcasing  in  Q  and  the  optimal  choice  of  Q  is  Q  =0.  (18)  fol¬ 


lows. 


n 


Figure  3.  Optimal  assignment  of  formal  power,  n2.  Three  regions 
shown  as  functions  of  the  channel  noise  parameters,  { X.,  tj  }  scaled 
by  FI.  The  dotted  line  is  the  contour  along  which  each  subchannel 
is  optimally  assigned  half  the  available  formal  power. 
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For  X/t)  <  1,  Figure  3  shows  that  the  optimal  distribution  of  n2  follows  the  general  scheme 
observed  for  the  special  case  X  =  0;  for  IT/t\  below  some  transition  point  (which  varies  as  a  function  of 
X/rj)  one  optimally  uses  only  the  Poisson  subchannel  and  above  this  transition  point  one  optimally  uses 
both  subchannels.  This  point  is  illustrated  in  Figure  4  where  the  channel  capacity  C  =  r(X,T|,  17 )  is  plot¬ 
ted  for  the  representative  case  X/T[=0.1.  The  similarity  with  the  plot  of  T^.ri.n)  in  Figure  2  is  evi¬ 
dent. 

The  capacity  formula  (17)  is  given  for  the  independent  increment  noise  channel  with  delayed 
causal  feedback  even  though  (17)  was  only  derived  for  the  case  of  message  encoding  without  feedback. 
As  has  already  been  observed,  it  follows  from  the  result  of  Kadota,  Zakai,  and  Ziv  in  [11]  that  the 
information  capacity  is  the  same  in  the  two  cases  of  delayed  feedback  and  no  feedback. 

A  quantity  of  significant  practical  interest  for  a  channel  is  the  coding  capacity  [2],  [17].  Briefly,  it 
is  the  supremum  of  information  rates  -  the  logarithm  of  the  number  of  messages  of  length  T  divided  by 
T  -  for  which  the  error  probability  can  be  made  arbitrarily  small  for  sufficiently  large  T .  The  coding 
capacity  Ccodcng  °f  the  independent  increment  noise  channel  follows  in  straightforward  fashion  from 
Theorem  3.  For  the  present  channel  model  with  x>0,  Ccodbjg =  T(X,  t]  ,  IT)  also.  Skipping  the  details,  this 
is  proved  first  by  showing  that  the  coding  capacity  of  the  marked  Poisson  channel  in  Theorem  2  has  the 
same  expression  as  that  given  for  the  information  capacity.  An  argument  similar  to  Wvner’s  [18]  for  the 
peak-constrained  Poisson  channel  proves  this.  Combined  with  the  known  coding  capacity  [2]  of  the 
additive  Gaussian  white  noise  channel,  this  gives  that  Ccoon>ro^r(X  11,11).  Fano’s  inequality  completes 
the  proof. 


IV.  PROOFS 


This  section  is  divided  into  two  subsections.  In  the  first,  a  result  for  the  information  in  the  parallel 
combination  channel  is  obtained,  leading  to  a  proof  of  Theorem  1.  The  second  subsection  contains  a 
proof  of  Theorem  2. 

Capacity  of  the  Parallel  Combination  Channel 

In  this  subsection  we  show  that  the  capacity  of  the  parallel  combination  channel  is  the  sum  of  the 
capacities  of  its  n  subchannels.  Repeated  use  is  made  of  Kolmogorov’s  formula  [13].  Let  Zj,  Z2,  Z3, 
and  Z4  be  four  random  elements  defined  on  a  common  probability  space.  Kolmogorov’s  formula  is 

I[(ZU  Z,).Z3]  =  /  [Z,, Z3]  +  E  [I  [Z2,  Z3 1 Z,]] .  (19) 

When  generalized  to  average  conditional  informations,  Kolmogorov’s  formula  takes  the  form  [13] 

£[/[(Z1,Z2),Z3IZ4]]  =  £[/[Z„Z3IZ4]]  +  £[/[Z2,Z3l(Z1,Z4)]] .  (20) 

Multiinfoimation  [15]  appears  naturally  in  the  following  discussion.  The  multiinformation 

M [Zi . Z„]  in  n  random  elements  ZX,...ZK  is  defined  as  a  relative  entropy  in  [15].  An  equivalent 

definition  is 

A/[Z,,...,Z„]  =  t/[(Z, . Z*_,),Zt].  (21) 

k=2 

Multiinformation  is  an  extension  of  average  mutual  information;  for  two  random  elements 
A/[Z1,Z2]=/[Z1,Z2].  Like  average  mutual  information,  multiinformation  is  nonnegative  and  zero  if  and 
only  if  the  corresponding  random  elements  are  mutually  independent. 

Lemma  1 :  Let  (Q,X,N ,Y)  be  a  parallel  combination  channel  with  n  subchannels.  Then,  for  all 

i  *j< 

ITm.Bj),Yj]=IT[QJ,YJ], 


rTm,Yi),Yj)  =  iT[Qi,Y,  1, 
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;r[(et.(y1.y>)]  =  /r[01,rI]. 

Proof.  We  prove  the  first  equality;  the  others  are  proved  similarly.  9,  and  Nj  are  independent  so 

/r[9I,y>  19;]  =  iT[ei,Xj(Qj)+Nj  iey]  =  o. 

Then,  by  Kolmogorov’s  formula  (19), 

Ir CO, , 0 j ),Yj ]  =  IT [0, ,Yj ]+E[IT [0,- ,  Yj  |0; ]] 

=  ITtfj,Yj]. 

Proposition  1:  Let  (Q,X  ,N ,Y)  be  a  parallel  combination  channel  with  n  subchannels.  Then, 
IT[Q,Y)  =  -  AfT[y„y2 . yn) 

i=i 

where  MT[Y  X,Y2,  ...,YK]  is  the  multiinformation  in  the  n  subchannel  outputs  over  the  interval  [0,7*]. 

Proof:  Consider  n  =2.  From  (19), 

/r  [(0i,  e^i.  y*)]  =  /r[0l,(y1,y2)]  +  £[/r[02.(yi,y2)i0.]]- 

By  Lemma  1  above,  [0i,(y T, y^] =/r  [Qi,  yx].  Then,  using  (20), 

/r[(0i,02),(yi.y2)i  =  /r[0i,yi] 

+  £[/r[02,r1i01]] 

+  h  [/r[02,  y  2K©i,y  i)]]  • 

Also 

/r[02,y2i(0„y,)]  =  /r[02,y2i(0i,/v,)]  =  /r[02,y2i0,] 

since  Y2  is  conditionally  independent  of  given  (01,02).  So 

/r[(0i,X2).(0i.y2)]  =  /r[0i,yi] 


+  £[/r[0i,y2iyl]] 

+  £[/r[02,  y2i0,]]. 
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Kolmogorov’s  formula  is  used  twice  more  to  obtain 

£[/7'[02,l,2l01]]=/r[(01.02).^2]-^7'[01.y2]. 

£[/r[91,y2iyI]]  =  /r[(0i,r1),y2]  -it[yuya. 

Then,  using  the  preceding  lemma,  we  get 

/r[(0i.02).(yi,}'2)]=/rC0i.>,1] 

-t-  /rce2,  yj  -  /T[0i,r2] 

+  /r[0i,y2]-/T[>,i.>,2] 

which  is  the  desired  result  for  n=  2.  Similarly,  for  n  =k  one  can  show  that 

iT  t(0i . 0*  Wi Yt)]  =  iT  [(0i , ....  9t-i)((y  i, ....  n_i)i 

+  /r  [9t ,  y*  ]  -  /T  [(y  „ y*_j),  r*  ] . 

Then,  by  recursion  and  the  relation  (21)  the  result  is  obtained  for  ail  n .  Therefore,  the  proof  is  com¬ 
plete. 


Since  multiinformation  is  always  nonnegative,  Proposition  1  implies  that  for  parallel  channels 
without  feedback,  the  channel  information  in  the  composite  channel  is  dominated  by  the  sum  of  the 
informations  in  the  component  subchannels.  Proposition  1  quantifies  the  degree  to  which  the  composite 
mutual  information  is  dominated  by  the  sum  of  its  component  informations  -  according  to  Proposition  1, 
the  sum  of  the  component  informations  exceeds  the  composite  information  by  exactly  the  multiinforma¬ 
tion  Afr[y1,..., y„]  in  the  channel  output.  Since  the  outputs  Yl . Yn  of  the  parallel  combination  chan¬ 

nel  are  mutually  independent  iff  8i,...,8„  are  mutually  independent,  the  proposition  further  implies  that 
the  composite  information  equals  the  sum  of  the  component  informations  if  and  only  if  the  components 
of  the  input  are  mutually  independent  We  state  this  formally  as  a  corollary  from  which  Theorem  1  fol¬ 
lows  directly. 

Corollary.  Let  (8,X ,N ,Y)  be  a  parallel  combination  channel  with  n  subchannels.  Then, 

/T[0,n<t/7'[0i,yi] 

<=i 
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with  equality  iff  0lf ....  9,  are  mutually  independent.  Also, 

<c=£c, 

;=i 

where  C  is  the  capacity  of  the  parallel  combination  channel  and  C,  is  the  capacity  of  its  i  th  subchan¬ 
nel. 


The  corollary  implies  that,  in  seeking  the  capacity  of  the  parallel  combination  channel,  one  need 
only  consider  messages  0  such  that  0l,02, ...,0„  are  mutually  independent. 

The  foregoing  calculations  are  similar  to  some  performed  by  Kadota  [8]  for  time  segments  of  the 
continuous-time  memoryless  channel. 

Capacity  of  the  Marked  Poisson  Channel 

The  proof  of  Theorem  2  relies  on  results  given  in  [5]  for  the  capacity  of  the  Poisson  channel 
(unmarked)  with  a  mean-square  constraint.  These  results  are  repeated  here  as  Lemmas  2  and  3. 

Lemma  2:  Fix  Q  SO,  A.S0  and  let  Aq  denote  the  set  of  all  nonnegative  random  variables  X  with 
constrained  second  moment  E[X2]<j22-  Also,  / JX]  =E[XlnX]-E [X ]ln£[X).  Then 

xS"PAQnX  +  \]  =  D(X,Q). 


Lemma  3:  Consider  the  Poisson  channel  with  Poisson-type  point  process  channel  output  Y,, 
t  g  [0,7]  with  compensator 

t 

j(Xr  +*)b(ds) 

0 

where  X>0  is  the  noise  intensity,  b  is  the  channel  base  measure  (b[0,7]  =  r),  and  is  a 

predictable  encoding  of  the  message  process  9,,  t  e  [0,T],  Let  the  encoder  intensity  %,  be  mean- 
square-constrained,  E [yj]  < <2 2  for  all  r  g  (0,7' ].  The  information  capacity  of  this  channel  is  D(k,Q). 
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Lemma  4:  Let  D  be  a  Borel  subset  of  1R  and  define  AD  to  be  the  class  of  random  variables  with 
range  in  D  .  Let  /  be  a  real  function  and  suppose  /-!,  the  inverse  of  / ,  exists  on  D .  For  P  e  D ,  define 
A  =  [X  e  AD:  E[f  (X)]£f  (P)}.  Let  g  be  a  real  function  such  that  g°f~l  is  concave  and  nondecreasing. 
Then 

J?™E[g(X)]  =  g(P). 


Proof'.  Define  h  =  g°f~]-  Then 

E  (g  (X  ))  =  E[k(f  (X  ))]<  h(E  If  (X )))  <  h  {/  (P  ))  =  g  (P ) . 

Let  X  =P .  Then  X  e  A  and  £[?(A')]  =  g(F).  The  result  follows. 

Lemma  5:  Consider  the  Poisson  channel  of  Lemma  3  except,  instead  of  the  mean-square  con¬ 
straint  imposed  there  on  the  encoder  intensity,  let  the  encoder  constraint  be 

T 


E[xi)b(dt)  <,Q2. 


(22) 


The  capacity  is  still  C  =  D(X,Q). 


Proof:  The  channel  information  is  [12] 


/T[0,y]  =  |E  fa,  lm],  -  f|,lmi(]6(<10. 


Here  T|,  ~x<  +  ^  and  "Hr  is  the  predictable  version  of  E[t|,  I Fj).  By  Jensen’s  inequality. 


lT[Q,Y]  <^[x,+X]b(dt). 


By  Lemma  2, 1[X  +A]  has  the  upper  bound  D(X,E  a[X2]).  Therefore 

r 


/rte,y]  s|d(A,Vm(7))6(^) 


where  M(t)  =  E ft,2].  Thus 


C  *  J  « J*> <*.  <MTT))b (di ) 


(23) 
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where  <X>  is  the  class  of  nonnegative  functions 

T 

<D=  [M (t): M (t)>0 ,  ~j\f(t)b(dt)ZQ2}. 

*  o 

The  function  D(k,fx)  is  nonnegative,  increasing,  and  concave  as  a  function  of  x.  Also,  b  is  finite  so, 
by  Lemma  4,  the  supremum  in  (23)  is  realized  for  M  (/)  equal  to  a  constant.  Thus  we  have 

C<±jD(\,JQl)b(dt)  =  D(k,Q). 

1  0 

The  constraint  E[x?],=  Q2  for  all  /  €  [0,7]  is  stronger  than  the  constraint  (22)  so  C>D(X,Q ),  also. 
This  completes  the  proof. 


Proof  of  Theorem  2:  We  begin  just  as  in  the  proof  of  Lemma  5.  The  channel  information  in  the 
marked  Poisson  channel  is 


J|£[Tl,(m)lnTi,(OT)  -  ■n,(m)lnfi,(ffi)]b(d/Xt(^); 


this  expression  is  easily  inferred  from  that  given  in  [12]  for  the  Poisson  channel.  Here 
Tlr ("0  =  X»('n)  + A.  and  fj,(/n)  is  the  predictable  version  of  £[Ti,(m)llF,r].  By  Jensen’s  inequality. 


lT [0,  Y }  <  J|/ [x, (m )  +  X]b (dt  )\i(dm ) . 


By  Lemma  2,  /[X  +X]  has  the  upper  bound  D(\,E'a[X2]).  Therefore 

T 

JJJ 

J  0 


IT  [0,  Y]  <  j  j  D  (X,  V M(t,m))b  ( dt  Mdm ) 


where  M(t  ,m)  =  E  \x?(m )].  Thus 


C  <  -J- c  ^  J  J D  a,  )h  (« ±  V  y(dm ) 

1  ’  7  0 

where  <1>  is  the  class  of  nonnegative  functions 

<D=  ]-\\M(t ,m)b(dt)\i(dm)<Q2) . 

*  70 


(24) 
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The  function  D(X,  V 3F)  is  nonnegative,  increasing,  and  concave  as  a  function  of  x.  Also,  b  and  p.  are 
finite  so,  by  Lemma  4,  the  supremum  in  (24)  is  realized  for  M (/  ,m)  equal  to  a  constant.  Thus  we  have 

1  7 

C<  ±\\D(k,<Q1)b(dt)\i{dm)=D(}.,Q).  (25) 

1  JO 

We  now  show  that  the  RHS  of  (25)  is  also  a  lower  bound  on  C.  Let  the  marks  on  the  channel 
output  be  ignored;  i.e.,  process  the  output  Y  of  the  marked  Poisson  channel  being  considered  here  as 
follows: 

(y,(F),F6l;)->W). 

In  other  words,  only  the  path  Y (J)  of  the  channel  output  is  retained.  In  such  cases 
/r[9, K] S/r[0,  Y (J)]  so  €  is  lower-bounded  by  the  capacity  of  the  Poisson  channel  (unmarked)  with 
channel  output  Y(J).  This  second  channel  has  noise  and  encoder  intensities 

=  X,  jxt(m)\i(dm) . 

Suppose  the  encoder  intensity  (m )  is  chosen  to  be  a  function  Xt=Xi(m)  °f  only  t  -  not  of  m.  Then 
we  have  a  Poisson  channel  with  noise  intensity  X  and  encoder  intensity  x<  with  constraint 

,  t 

±[E[X?]b(dt)  <  Q2. 

1  o 

Therefore,  by  Lemma  5,  C >D (X,Q)  and  the  theorem  is  proved. 
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